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Abstract 



In 1993 E. I. Zelmanov has put the following question in Dniester Notebook: Suppose 
that Fi m is a free 2-generated associative ring with the identity x m = 0. Is it true 
that the nilpotency degree of i*2,m has exponential growth? 

We give the definitive answer to E. I. Zelmanov's question showing that the 
nilpotency class of an /-generated associative algebra with the identity x d = is 
smaller than \P(cZ, d,l), where 

*(n,d,Z) = 2 18 /(nd) 31 °g3M)+ 13 ( i 2 . 

>: 

This result is a consequence of the following fact based on combinatorics of words. 
Let I, n h d ^ n be positive integers. Then all words over an alphabet of cardinality 
£S) ' I whose length is not less than ^(n, d, I) are either n-divisible or contain ci-th power 

■ of a subword; a word W is n-divisible if it can be represented in the form W = 

WqWi ■ ■ ■ W n such that W\, W2, ■ ■ ■ , W n are placed in lexicographically decreasing 
order. Our proof uses Dilworth theorem (according to V. N. Latyshev's idea). We 
show that the set of not n-divisible words over an alphabet of cardinality I has height 
h < $(n, I) over the set of words of degree ^ (n — 1), where 



*(n,0 = 2 87 Z.n 121 °S3 n+48 . 

Keywords: Shirshov theorem on height, word combinatorics, n-divisibility, Dilworth 
theorem, Burnside-type problems. 

1 Introduction 

1.1 Shirshov theorem on height 

In 1958 A. I. Shirshov has proved his famous theorem on height ([I], [2]). 
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Onpe/jejieHHe 1 A word W is called n-divisible if W can be represented in the form 
W = VU1U2 • ■ ■ u n such that u\ >- U2 >- ■ ■ ■ >- u n . 

In this case any non-identical permutation a of subwords produces a word W a = 
TO a(i) u <7(2)"'U<r(n)' which is lexicographically smaller than W. Some authors take this 
feature as the definition of n-divisibility. 

Onpe,n,ejieHHe 2 A Pl-algebra A is called an algebra of bounded height h = Hty(A) 
over a set of words Y = {u\,U2, ■ • •} if h is the minimal integer such that any word x from 
A can be represented in the form 



where {rj} don't exceed h. The set Y is called a Shirshov basis for A. If no misunder- 
standing can occur, we use h instead of Hty (A). 

Shirshov theorem on height. ([I], [2]) The set of not n-divisible words in a finitely 
generated algebra with an admissible polynomial identity has bounded height H over the 
set of words of degree not exceeding n — 1. 

The Burnside-type problems related to height theorem are considered in [3]. The au- 
thors believe that Shirshov theorem on height is a fundamental fact in word combinatorics 
independently of its applications to Pi-theory. (All our proofs are elementary and fit in the 
framework of word combinatorics.) Unfortunately the experts in combinatorics haven't 
sufficiently appraised this fact yet. As regards the notion of n-divisibility itself, it seems 
to be fundamental as well. V. N. Latyshev's estimates on £ n (&), the number of non-n- 
divisible polylinear words in k symbols, have lead to fundamental results in Pi-theory. At 
the same time, this number is nothing but the number of arrangements of integers from 1 
to k, such that no n integers (not necessarily consecutive) are placed in decreasing order. 
Furthermore it is the number of permutationally ordered sets of diameter n consisting of k 
elements. (A set is called permutationally ordered if its ordering is the intersection of two 
linear orderings, the diameter of an ordered set is the length of its maximal antichain.) 

Height theorem implies the solution of a set of problems in ring theory. Suppose 
an associative algebra over a field satisfies a polynomial identity f(xi, . . . ,x n ) = 0. It 
is possible to prove that then it satisfies an admissible polynomial identity (that is, a 
polynomial identity with coefficient 1 at some term of the highest degree): 



where a a belong to the ground field. In this case, if W — VU1U2 • ■ ■ u n is n-divisible then 
for any permutation o the word W a = vu a mu a t2) • • • u a(n) is lexicographically smaller 
than W, and thus an n-divisible word can be represented as a linear combination of 
lexicographically smaller words. Hence a Pl-algebra has a basis consisting of non-ra- 
divisible words. By Shirshov theorem on height, a Pl-algebra has bounded height. In 
particular, if a Pl-algebra satisfies x n = then it is nilpotent, that is, any its word of 
length exceeding some N is identically zero. Surveys on height theorem can be found in 
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This theorem implies the positive solution of Kurosh problem and of other Burnside- 
type problems for PI-KOJien;. Indeed, if Y is a Shirshov basis and all its elements are 
algebraic then the algebra A is finite-dimensional. Thus Shirshov theorem explicitly in- 
dicates a set of elements whose algebraicity makes the whole algebra finite-dimensional. 
This theorem implies 

Corollary 1.1 (Berele) Let A be a finitely generated PI- algebra. Then 

GK(A) < oo. 

GK(A) is the Gelfand — Kirillov dimension of the algebra A, that is, 

GK(A) = lim \nV A (n)/\n(n) 

where Va(ti) is the growth function of A, the dimension of the vector space generated by 
the words of degree not greater than n in the generators of A. 

Indeed, it suffices to observe that the number of solutions for the inequality k\\v\ \ + 
h k h \v h \ ^ n with h ^ H exceeds N H , so that GK(A) < Et(A). 

The number m = deg(A) will mean the degree of the algebra, or the minimal degree of 
an identity valid in it. The number n = Pid(A) is the complexity of A, or the maximal k 
such that Mfc, the algebra of matrices of size k, belongs to the variety Var(A) generated 
by A 

Instead of the notion of height, it is more suitable to use the close notion of essential 
height. 

Onpe,a,ejieHHe 3 An algebra A has essential height h = He S s{A) over a finite set Y called 
an s-basis for A if there exists a finite set D C A such that A is linearly representable by 
elements of the form t\ ■ . . . ■ t h where I ^ 2h + 1, Vi(ij G D V ^ = y^; yi G Y) and the set 
of i such that ti G" D contains not more than h elements. The essential height of a set of 
words is defined similarly. 

Informally speaking, any long word is a product of periodic parts and "gaskets" of 
restricted length. The essential height is the number of periodic parts, and the ordinary 
height accounts "gaskets" as well. 

The height theorem suggests the following questions: 

1. To which classes of rings the height theorem can be extended? 

2. Over which Y the algebra A has bounded height? In particular, what sets of words 
can be taken for {v $}? 

3. What is the structure of the degree vector {k\, . . . , fc^)? First of all, what sets of its 
components are essential, that is, what sets of hi can be unbounded simultaneously? 
What is the value of essential height? Is it true that the set of degree vectors has 
some regularity properties? 

4. What estimates for the height are possible? 
Let us discuss the above questions. 
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1.2 Non- associative generalizations 

The height theorem was extended to some classes of near-associative rings. S. V Pchelint- 
sev |9j has proved it for the alternative and the (—1,1) cases, S. P. Mishchenko [10J lias 
obtained an analogue of the height theorem for Lie algebras with a sparse identity. In the 
paper by one of the authors [TT] the height theorem was proved for some class of rings 
asymptotically close to associative rings. In particular, this class contains alternative and 
Jordan Pi-algebras. 

1.3 Shirshov bases 

Suppose A is a Pi-algebra and a subset M C A is its s-basis. Then if all elements of M 
are algebraic over K then A is finite-dimensional (the Kurosh problem). Boundedness of 
essential height over Y implies "the positive solution of the Kurosh problem over Y". The 
converse is less trivial. 

Theorem 4 (A. Ya. Belov) a) Suppose A is a graded Pl-algebra, Y is a finite set of 
homogeneous elements. Then if for all n the algebra A/Y^ is nilpotent then Y is an 
s-basis for A. If moreover Y generates A as an algebra then Y is a Shirshov basis for A. 

b) Suppose A is a Pl-algebra, M C A is a Kurosh subset in A. Then M is an s-basis 
for A. 

Let Y^ n > denote the ideal generated by nth powers of elements from Y . A set M C A 
is called a Kurosh set if any projection it: A <g> K[X] — > A' such that the image ir(M) is 
entire over 7r(A[X]) is finite-dimensional over tt(K[X}). The following example motivates 
this definition. Suppose A = Q[x, 1/x]. Any projection tt such that ir(x) is algebraic has 
a finite-dimensional image. However the set {x} is not an s-basis for Q[x,l/x]. Thus 
boundedness of essential height is a non- commutative generalization of the property of 
entireness. 

1.4 Shirshov bases consisting of words 

The Shirshov bases consisting of words are described by the following 

Theorem 5 ([4], [12| ) A set Y of words is a Shirshov basis for an algebra A iff for any 
word u of length not exceeding m = Pid(A), the complexity of A, the set Y contains a 
word cyclically conjugate to some power of u. 

A similar result was obtained independently by G. P. Chekanu and V. Drensky. Prob- 
lems related to local finiteness of algebras and to algebraic sets of words of degree not 
exceeding the complexity of the algebra were investigated in [JJ [131 H2 HS1 EH HZ1 HE]- 
Questions related to generalization of the independence theorem were considered in these 
papers as well. 

1.5 Essential height 

Clearly the Gelfand — Kirillov dimension is estimated by the essential height. Furthermore 
an s-basis is a Shirshov basis iff it generates A as an algebra. In the representable case 
the converse is also true. 
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Theorem 6 (A. Ya. Belov, [4J) Suppose A is a finitely generated representable algebra 
and H EssY (A) < oo. Then H EssY (A) = GK(A). 



Corollary 1.2 (V. T. Markov) The Gelfand — Kirillov dimension of a finitely gener- 
ated representable algebra is an integer. 

Corollary 1.3 If He ss y{A) < oo and A is representable then He ss y{A) is independent 
of choice of the s-basis Y . 

In this case the Gelfand — Kirillov dimension also is equal to the essential height by 
virtue of local representability of relatively free algebras. 

Structure of degree vectors. Although in the representable case the Gelfand - 
Kirillov dimension and the essential height behave well, even in this case the set of degree 
vectors may have a bad structure, namely, it can be the complement to the set of solutions 
of a system of exponential-polynomial Diophantine equations [4J . That is why there exists 
an instance of a representable algebra with the trascendent Hilbert series. However for a 
relatively free algebra, the Hilbert series is rational |19] . 

1.6 n-divisibility and Dilworth theorem 

The significance of the notion of n-divisibility exceeds the limits of Burnside-type prob- 
lems. This notion is also actual in investigation of polylinear words and estimation of 
their number; a word is polylinear if each letter occurs in it at most one time. V. N. Laty- 
shev applied Dilworth theorem for estimation of the number of not m-divisible polylinear 
words of degree n over the alphabet {ai, . . . , a n }. The estimate is (m — l) 2n and is rather 
sharp. Let us recall this theorem. 

Dilworth theorem: Let n be the maximal number of elements of an antichain in a 
given finite partially ordered set M . Then M can be divided into n disjoint chains. 

Consider a polylinear word W consisting of n letters. Put >~ a,- if i > j and the letter 
ai is located in W to the right from aj . The condition of not ^-divisibility means absence 
of an antichain consisting of n elements. Then by Dilworth theorem all positions (and the 
letters as well) split into (n — 1) chains. Attach a specific color to each chain. Then 
the coloring of positions and of letters uniquely determines the word W. Furthermore the 
number of these colorings does not exceed {n — l) k x (n — l) fc — (n — l) 2k . 

The above estimate implies validity of polylinear identities corresponding to an ir- 
reducible module whose Young diagram includes the square of size n 4 . This in turn 
enables, firstly, to obtain a transparent proof for Regev theorem which asserts that a 
tensor product of Pi-algebras is a Pi-algebra as well; secondly, to establish the existence 
of a sparse identity in the general case and of a Capelli identity in the finitely generated 
case (and thus to prove the theorem on nilpotency of the radical); and thirdly, to real- 
ize A. R. Kemer's "supertrick" that reduces the study of identities in general algebras to 
that of super-identities in finitely generated superalgebras of zero characteristic. Close 
questions are considered in (2D1 El ED]- 

Problems related to enumeration of polylinear words which are not n-divisible are 
interesting of their own. (For example, there exists a bijection between not 3-divisible 
words and Catalana numbers.) On one hand, this is a purely combinatorial problem, but 
on the other hand, it is related to the set of codimensions for the general matrix algebra. 



5 



The study of polylinear words seems to be of great importance. V. N. Latyshev (see for 
instance [22j) has stated the problem of finite-basedness of the set of leading polylinear 
words for a T-ideal with respect to taking overwords and to isotonous substitutions. This 
problem implies the Specht problem for polylinear polynomials and is closely related to 
the problem of weak Noetherian property for the group algebra of an infinite finitary 
symmetric group over a field of positive characteristic (for zero charasteristic this was 
established by A. Zalessky). To solve the Latyshev problem, it is necessary to translate 
properties of T-ideals to the language of polylinear words. In (U [11] an attempt was 
made to realize a project of translation of structure properties of algebras to the language 
of word combinatorics. Translation to the language of polylinear words is simpler and 
enables to get some information on words of a general form. 

In this paper we transfer V. N. Latyshev's technique to the non-polylinear case, 
and this enables us to obtain a subexponential estimate in Shirshov height theorem. 
G. R. Chelnokov suggested the idea of this transfer in 1996. 

1.7 Estimates for height 

The original A. I. Shirshov's proof, being purely combinatorial (it was based on the 
technique of elimination developed by him for Lie algebras, in particular in the proof of 
the theorem on freeness), nevertheless implied only primitively recursive estimates. Later 
A. T. Kolotov [23] obtained an estimate for Ht(A) ^ l l " (n = deg(A), I is the number 
of generators). A. Ya. Belov in |24| has shown that Ht(n, I) < 2nl n+1 . The exponential 
estimate in Shirshov height theorem was also presented in [121 ESI [26j [39] . The above 
estimates were sharpened in the papers by A. Klein [27, 28J. In 2001 Ye. S. Chibrikov 
proved in [2H] that Ht(4, 1) ^ (7k 2 -2k). M. I. Kharitonov in [SEMES] obtained estimates 
for the structure of piecewise periodicity. In 2011 A. A. Lopatin [31] obtained the following 
result: 

Theorem 7 Let C n j be the nilpotency degree of a free l-generated algebra satisfying x n = 
0, and let p be the characteristic of the ground field of the algebra, greater than n/2. Then 

(1) : C n>l < 4 ■ 2 n ' 2 l. 

By definition C n j ^ ty(n,n,l). Observe that for small n the estimate (1) is smaller than 
the estimate ^(n, n, I) established in this paper but for growing n the estimate $?(n, n, I) 
is asymptotically better than (1). 

Ye. I. Zelmanov has put the following question in the Dniester Notebook [32J in 1993: 

Question 1.1 Let F2, m be the free 2-generated associative ring with identity x m = 0. Is 
it true that the nilpotency class of F2, m grows exponentially in ml 

Our paper answers Ye. I. Zelmanov's question as follows: the nilpotency class in 
question grows subexponentially. 

1.8 The results obtained 

The main result of the paper is as follows: 
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Theorem 8 The height of the set of not n-divisible words over an alphabet of cardinality 
I relative to the set of words of length less than n does not exceed I) where 

$(n,Z) = E x l ■ n ^ + 121og 3 n . ? 

Ei = 421 io g3 4+i7 > E2 = 30 i g 3 4 + io. 

This theorem after some coarsening and simplification of the estimate implies that for 
fixed / and n — > 00 we have 

$(n,Z) < 2 87 / • n 121 °S3™+ 48 = n i2(i+o(i))iog s » 
and for fixed n and / — > 00 we have 

$(n,Z) < C(n)Z. 

Corollary 1.4 TTie height of an l-generated Pl-algebra with an admissible polynomial 
identity of degree n over the set of words of length less than n does not exceed $(n, /). 

Moreover we prove a subexponential estimate which is better for small n: 

Theorem 9 The height of the set of not n-divisible words over an alphabet of cardinality 
I relative to the set of words of length less than n does not exceed $(n, I) where 

$(n,Z) = 2 40 /-n 38+81og 2™. 

In particular we obtain subexponential estimates for the nilpotency index of /-generated 
nil-algebras of degree n in an arbitrary characteristic. 
The second main result of our paper is the following 

Theorem 10 Let I, n and d ^ n be positive integers. Then all l-generated words of length 
not less than ^(n,d,l) either contain x d or are n-divisible. Here 

V(n,d,l) = 45+3Iog 3 4 Z ( nd )31og s (nd)+(5+61og34) d 2_ 

This theorem after some coarsening and simplification of the estimate implies that for 
fixed I and nd — > 00 we have 

#(M,Z) < 2 lS l(nd) 3l °^ nd ^ 13 d 2 = ( n d)3(l+o(l))log3(nd) j 

and for fixed n and I — > 00 we have 

V(n,d,l) < C(n,d)l. 

Corollary 1.5 Let I, d be positive integers, and let an associative l-generated algebra A 
satisfy x d = 0. Then its nilpotency index is less than *ff(d, d, I). 

Moreover we prove a subexponential estimate which is better for small n and d: 

Theorem 11 Let I, n and d ^ n be positive integers. Then all l-generated words of length 
not less than fy(n,d,l) either contain x d or are n-divisible. Here 

V(n,d,l) = 25Ql(nd) 2lo ^ nd)+10 d 2 . 
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Notation 1.1 For a real number x put r x~ l := —[—a;]. Thus we replace non-integer num- 
bers by the closest greater integers. 

Proving theorem [8] we also prove the following theorem on estimation of essential height: 

Theorem 12 The essential height of an l-generated PI -algebra with an admissible poly- 
nomial identity of degree n over the set of words of length less than n is less than T(n, I), 
where 

T(n,Z) = 2n 3riog ^ n + 4 /. 

In [33] it is established that the nilpotency index of an /-generated nil-semiring of 
degree n equals the nilpotency index of an /-generated nilring of degree n, where addition 
is not supposed to be commutative. (The paper also contains examples of non-nilpotent 
nil-nearrings of index 2.) Thus our results extend to the case of semirings as well. 

1.9 On estimates from below 

Let us compare the results obtained with the estimate for height from below. The height 
of an algebra A is not less than its Gelfand — Kirillov dimension GK(A). For the algebra 
of /-generated general matrices of order n this dimension equals (/ — l)n 2 + 1 (see |34j as 
well as |35|). At the same time, the minimal degree of an identity in this algebra is 2n by 
Amitsur — Levitsky theorem. We have 

Proposition 1.1 The height of an l-generated Pl-algebra of degree n and of the set of 

not n-divisible words over an alphabet of cardinality I is not less than (/ — l)n 2 /4 + 1. 

Estimates from below for the nilpotency index were established by Ye. N. Kuzmin 
in [36]. He gave an example of 2-generated algebra with identity x n = 0, such that its 
nilpotency index exceeds (n 2 + n — 2)/2. The problem of finding estimates from below is 
considered in [30] . 

At the same time, for zero characteristic and a countable set of generators, Yu. P. Razmyslov 
(see for instance [37] ) obtained an upper estimate for the nilpotency index, namely n 2 . 

First we will prove theorem [TOj and in the following section we will deal with estimates 
for essential height, that is, for the number of distinct periodical pieces in a not n-divisible 
word. 

Acknowledgements. The authors are grateful to V. N. Latyshev, A. V. Mikhalyov 
and all participants of the seminar "Ring theory" for their attention to our work, as well as 
to the participants of the seminar at Moscow Physic-technical institute under the guidance 
of A. M. Raigorodsky. 

2 Estimates on occurence of degrees of subwords 
2.1 The outline of the proof for theorem [TO 

Lemmas 12. 14 12.21 and 12.31 describe sufficient conditions for presence of a period of length d 
in a not n-divisible word W . Lemma T2.4I connects n-divisibility of a word W with the set 
of its tails. Further we choose some specific subset in the set of tails of W, such that we 
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can apply Dilworth theorem. After that we color the tails and their first letters according 
to their location in chains obtained by application of Dilworth theorem. 

We have to know the position in any chain where neighboring tails begin to differ. It 
is of interest what is the "frequency" of this position in a p-tail for some p ^ n. Further 
we somewhat generalize our reasoning dividing tails into segments consisting of several 
letters each and determining the segment containg the position where neighboring tails 
begin to differ. Lemma [3.21 connects the "frequencies" in question for p-tails and fcp-tails 
for k = 3. 

To complete the proof, we construct a hierarchical structure based on lemma I3T2"} that 
is, we consecutively consider segments of n-tails, subsegments of these segments and so 
on. Furthermore we consider the greatest possible number of tails in the subset to which 
Dilworth theorem is applied, and then we estimate from above the total number of tails 
and hence of the letters in the word W. 

2.2 Periodicity and n-divisibility properties 

Let ai, 02, • • • , ai be the alphabet used for constructing words. The ordering a\ -< a 2 -< 
■ ■ ■ -< di induces lexicographical ordering for words over the alphabet. For convenience, 
we introduce the following definitions. 

Onpe,a,ejieHHe 13 a) If a word v includes a subword of the form u l then we say that v 
includes a period of length t. 

b) If a word u is the beginning of a word v then these words are called incomparable. 

c) A word v is a tail of a word u if there exists a word w such that u = wv. 

d) A word v is a fc-tail of a word u if v consists of k first letters of some tail u. 
d*) A A;-beginning is the same as k-tail. 

e) A word u is to the left from a word v if u begins to the left from the beginning of v. 

Notation 2.1 a) For a real number x put r x n := —[—a;]. 
b) Let \u\ denote the length of a word u. 

The proof uses the following sufficient conditions for presence of a period: 

Lemma 2.1 In a word W of length x either first [x/d] tails are pairwise comparable or 
W includes a period of length d. 

^OKa3aTejit.CTBO Suppose W includes no word of the form u d . Consider first [x/d] 
tails. Suppose some two of them, say v± and t> 2 , are incomparable and v± — u • v<i. Then 
v 2 = u ■ v 3 for some V3. Furthermore V\ = u 2 ■ v%. Arguing in this way we obtain that 
v\ = u d ■ Vd+i since \u\ < x/d, | ^2 1 ^ (d — l)x/d. A contradiction. 

Lemma 2.2 If a word V of length k ■ t includes not more than k different subwords of 
length k then V includes a period of length t. 

^OKa3aTejifaCTBO We use induction in k. The base k = 1 is obvious. If there are not 
more than {k — 1) different subwords of length (k — 1) then we apply induction assumption. 
If there exist k different subwords of length (k — 1) then every subword of length k is 
uniquely determined by its first {k — 1) letters. Thus V = v l where v is a k-tail ofV. 



9 



Onpe,n,ejieHHe 14 a) A word W is n-divisible in ordinary sense if there exist u\, 112, . . . ,u n 
such that W = v ■ u\ - ■ ■ u n and u\ y . . . y u n . 

b) In our proof we will call a word W n-divisible in tail sense if there exist tails 
U\, . . . , u n such that U\ y U2 y ■ ■ ■ y u n and for any i — 1, 2, . . . , n — 1 the beginning of 
Ui is to the left from the beginning of Wj+i- If the contrary is not specified, an n-divisible 
word means n-divisible in tail sense. 

c) A word W is n- cancellable if either it is n-divisible in ordinary sense or there exists 
a word of the form u d C W .. 

Now we describe a sufficient condition for n-cancellability and its connection with 
n-divisibility. 

Lemma 2.3 If a word W includes n identical disjoint subwords u of length n ■ d then W 
is n- cancellable. 

,ZLoKa3aTejii>CTBO Suppose the contrary. Consider the tails u%, U2, ■ ■ ■ , u n of the word 
u which begin from each of the first n letters of u. Renumerate the tails to provide the 
inequalities u\ y . . . y u n . By lemma 1 the tails are incomparable. Consider the subword 
u\ in the left-most copy of u, the subword U2 in the second copy from the left, . . . , u n in 
the nth copy from the left. We get an n-division ofW . A contradiction. 

Lemma 2.4 If a word W is 4nd- divisible then it is n- cancellable. 

^OKasaTejibCTBO Suppose the contrary. Consider the numbers of positions of letters 
a i; ai < ct2 < • • • < a^nd, that begin the tails Ui dividing W. Set a 4n ^ +1 = \W\. If 
W is not n-cancellable then there exists i, 1 ^ i ^ 4(n — l)d + 1, such that for any 
i^b<c^d<e^i + 4d the (a c — ab)-tail Ub is incomparable with the (a e — ad) -tail Ud- 
Compare — a-i and ai + 4d — &i+2d- We may assume that 0^+4^ — di+2cz ^ o«+2d — Oj. Let 
a,j + \ — aj = inf (at+i — ctfc), ^ j < 2d. We may assume that j < d. By assumption the 

k 

(a2d — a, j) -tail Uj and the (a2d — aj + i)-tail Uj+i are incomparable with the (a^ — a2d)-tail 
u 2 d- Since a 4( ^ — a2d ^ Ci2d — a j > ^d ~ a j+i, the (a 2 d — aj)-tail Uj and the (a2d — aj + i)-tail 
Uj + i are mutually incomparable. Since ~~~ ^ the (a J+ i — a j) -tail Uj in degree d 
is included into the (a2<i — aj)-tail Uj. A contradiction. 

Corollary 2.1 If a word W is not n-divisible in ordinary sense then W is not 4nd- 
divisible (in tail sense). 

Notation 2.2 Set p n>d := 4n<i - 1. 

Let W be a not n-cancellable word. Consider U, the [\W\ /<i]-tail of W. Then W is 
not {p n> d + Indivisible. Let Q be the set of tails of W, which begin in U. Then by lemma 
12.11 any two elements of Q are comparable. There is a natural bijection between O, the 
letters of U and positive integers from 1 to |0| = \U\. 

Let us introduce a word 9 which is lexicographically less than any other word. 

Remark 2.1 In the current proof of theorem\T(j\ all tails are assumed to belong to Vt. 
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3 Estimates on occurence of periodical fragments 



An application of Dilworth theorem. For tails u and v put u < v if u -< v and 

u is to the left from v. Then by Dilworth theorem, Q can be divided into p n( i chains 
such that in each chain u -< v if u is to the left from v. Paint the initial positions of the 
tails into p n ^ colors according to their occurence in chains. Fix a positive integer p. To 
each positive integer i from 1 to \Q\, attach B p (i), an ordered set of p n ^ words {f{i,j)} 
constructed as follows: 

For each j = 1,2,..., p n4 put 

f(i,j) = {max / ^ i : f is painted into color j} . 

If there is no such f then the word from B p (i) at position j is assumed to be equal to 
9, otherwise to the p-tail that begins from the f(i,j)-th letter. 

Informally speaking, we observe the speed of "evolution" of tails in their chains when 
the sequence of positions in W is considered as the time axis. 

3.1 The sets B p (i) and the process at positions 

Lemma 3.1 (On the process) Given a sequence S of length \S\ consisting of words of 
length (k — 1). Each word consists of (k — 2) symbols "0" and a single symbol "1". Let S 
satisfy the following condition: 

If for some < s ^ k — 1 there exist p n ^ words such that "1" occupies the sth position 
then between the first and the p n ^-th of these words there exists a word such that "1" 
occupies a position with number strictly less than s; L(k — 1) = sup \S\. 

ThenL(k-l)^p k J-l. 

^OKa3aTejn>CTBO We have L(l) ^ p n ^ — 1. Let L(k — 1) ^ p k ^ — 1. We will show 
that L(k) ^ p k ld — 1. Consider the words such that "1" occupies the first position. Their 
number does not exceed p n ^ — 1 . Between any two of them as well as before the first one 
and after the last one, the number of words does not exceed L{k — 1) ^ p k ~^ — 1. Hence 

L(k) ^ p n4 - 1 + (p nA ) \ (Pn,d) k ~~ 1 ~ l) = (Pn,df ~ !> 

as required. 

We need a quantity which estimates the speed of "evolution" of sets B p (i): 
Onpe,n,ejieHHe 15 Set 

i/j(p) := {max k : B p (i) = B p (i + k - 1)} . 

In particular, by lemma l272\ we have ip(p n ,d) ^ p n ,dd. 

For a given a we divide the sequence of the first \Q\ positions iofW into equivalence 
classes ~ a as follows: i ~ a j if B a {i) = B a (j). 

Proposition 3.1 For any positive integers a < b we have ip(a) ^ ip{b). 
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Lemma 3.2 (basic) For any positive integers a, k we have 

■0(a) ^ p k n d ip(k ■ a) + k ■ a 

^OKa3aTejiBCTBO Consider the least representative in each class of ~fe. a . We get a 
sequence of positions {ij}. Now consider all ij and B k ' a (ij) from the same equivalence 
class o/~ a . Suppose it consists of B k ' a {ij) for ij G [b, c). Let {ij}' denote the segment of 
the sequence {ij}, such that ij G [b,c — k ■ a). 

Fix a positive integer r, 1 ^ r ^ p n ,d- All k ■ a-beginnings of color r that begin from 
positions of the word W in {ij}' will be called representatives of type r. All representatives 
of type r are pairwise distinct because they begin from the least positions in equivalence 
classes of ~fc. a . Divide each representative of type r into k segments of length a. Enu- 
merate segments inside each representative of type r from left to right by integers from 
zero to (k — 1) . // there exist (p n< d + 1) representatives of type r with the same first (t — 1) 
segments but with pairwise different tth segments where 1 ^ t ^ k — 1 then there are two 
tth segments such that their first letters are of the same color. Then the initial positions 
of these segments belong to different equivalence classes of ~ a . 

Now apply lemma lBTJl as follows: in all representatives of type r except the rightmost 
one we consider a segment as a unit segment if it contains the least position where this 
representative of type r differs from the preceding one. All other segments are considered 
as zero segments. 

Now we apply the process lemma for the values of parameters as given in the condition 
of the lemma. We obtain that the sequence {ij}' contains not more than p k ~ d x repre- 
sentatives of type r. Then the sequence {ij}' contains not more than p k ld terms. Thus 
c — b ^ p k n d ip{k ■ a) + k ■ a. 

3.2 Completion of the proofs for theorems [10] and [TT] 

Let 

ao = 3 ri °S3P»,<T ai = 3 no g3P „^-i ; ? ar log3Pn ^ = 1. 

Then \W\ ^ d |Q| + d by lemma O 

Since for the set B 1 {i) not more than (1 + p n ^l) different values are possible, we have 
\W\ ^ d(l+p n , d l)ip(l)+d. 

By lemma I3.2I 

0(1) < {pl 4 +P M M3) < ( P l d + Pn,d) 2 ^) < ■ < (pl d +Pn,d) rlOS3Pn ' d MPn4) ^ 

Take p n ^ = ^nd — 1 , to get 

\W\ < 4 5+31o g3 4 /( n ^31og3(nd)+(5+61og 3 4)^2_ 

This implies the assertion of Theorem 1101 

The proof of theorem [TT] is completed similarly but instead of the sequence 

~ _ Q r log 3 p„, d n _ o r log 3 p nid n -l 1 
Uq — O , Ui — O , . . . , « r log 3 p n!d n — 1 

we have to consider the sequence 

ao = 2 r{o ^\ a x = 2 rio ^^ n -\ . . . , ar log2Pn d , = 1. 
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4 An estimate for the essential height. 



In this section we proceed with the proof of the main theorem [SJ In passing, we prove 
theorem [T2J We consider positions of letters in the word W as the time axis. That is, a 
subword u occurs before a subword v if u is entirely to the left from v in W. 

4.1 Isolation of distinct periodical fragments in the word W 

Let s denote the number of subwords in W such that each of them includes a period of 
length less than n more than 2n times and each pair of them is separated by subwords of 
length greater than n, comparable with the preceding period. Enumerate these from the 
beginning to the end of the word: xf 1 , x 2 n , ■ ■ ■ , x 2n . Thus W = yoxf n yix 2 n ■ ■ ■ x 2 ™y s . 

If there is i such that the word X{ has length not less than n then the word xj includes 
n pairwise comparable tails, hence the word xf n is n-divisible. Then s is not less than the 
essential height of W over the set of words of length less than n. 

Onpe,n,ejieHHe 16 A word u will be called non-cyclic if u is not representable in the form 
v k where k > 1. 

Onpe/jejieHHe 17 A word cycle u is the set consisting of the word u and all its cyclis 
shifts. 

Onpe/jejieHHe 18 A word W is strongly n-divisible if it is representable in the form 
W = WqW\ ■ ■ ■ W n where the subwords W%, . . . , W n are placed in the lexicographically 
decreasing order and each of W{, % — 1, 2, . . . , n begins from some word z\ G Z where all 
Z{ are distinct. 

Lemma 4.1 If there is an integer m, 1 ^ m < n, such that there exist (2n — 1) pairwise 
incomparable words of length then W is n-divisible. 

^OKa3aTejifaCTBO Put x := x^. Then W includes disjoint subwords x pi v[, . . . , x P2n ~ 1 v' 2n _ 
where pi, . . . ,P2n-i are positive integers greater than n, and v[, . . . ,v' 2n _ l are words of 
length m comparable with x,v[ = v^. Hence among the words v[, . . . ,v' 2n _ 1 either there 
are n words lexicographically greater than x or there are n words lexicographically smaller 
than x. We may assume that v[,...,v' n are lexicographically greater than x. Then W 
includes subwords v[,xv' 2 , ■ ■ ■ 1 x n ~ 1 v' n) which lexicographically decrease from left to right. 

Consider an integer m, 1 ^ m < n. Divide all Xi of length m into equivalence classes 
relative to strong incomparability and choose a single representative from each class. Let 
these be , . . . , Xy , where s' is a positive integer. Since the subwords Xi are periods, we 
consider them as word cycles. 

Notation 4.1 Vk := Xi k 

Let v(k, i) where i is a positive integer, 1 < i < m, be a cyclic shift of a word Vk by 
(k — 1) positions to the right, that is, v(k,l) = and the first letter of v(k, 2) is the 
second letter of v^. Thus is a word cycle ofvk- Note that for any 1 ^ H,i2 ^ 

p, 1 ^ ji,j2 ^ m the word v (ii, ji) is strongly incomparable with v(i 2 ,j 2 ). 

Remark 4.1 The cases m = 2, 3, n — 1 were considered in f3fy \2b\ ,39]. 
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4.2 An application of Dilworth theorem 

Consider a set Q' = {v(i,j)}, where 1 ^ i ^ p, 1 ^ j ^ m. Order the words v(i,j) as 
follows: 

v(h,ji) y v(i 2 ,j 2 ) if 

1) v(i u ji) > v(i 2 ,j 2 ) 

2) ii > i 2 . 

Lemma 4.2 // in the set Q' with ordering >- there exists an antichain of length n then 
W is n-divisible. 

^OKasaTejifaCTBO Suppose there exists an antichain of length n consisting of words 
v {h, ji),v(i 2 , j 2 ), . . . ,v(i n , j n ) ; here %\ ^ i 2 ^ • • • ^ i n . If all inequalities between ij. are 
strict then W is n-divisible by definition. 

Suppose that for some r there exist i r+ \ = ■ ■ ■ = i r+k such that either r = ori r < i r +i- 
Moreover the positive integer k is such that either k = n — r or i r+ ^ < i r +k+i- 

The word Si r+1 is periodical, hence it is representable as a product of n copies ofv^ . 
The word v\ includes a word cycle Vi r+1 . Hence in Si r+1 there exist disjoint subwords 
placed in lexicographically decreasing order and equal to v (i r +i, jV+i)> • • • ,v(i r +k, jr+k) re- 
spectively. Similarly we deal with all sets of equal indices in the sequence {i r }" =1 . The 
result is n- divisibility ofW. A contradiction. 

Thus Of can be divided into (n — 1) chains. 

Notation 4.2 Put q n — (n — 1). 

4.3 The sets C a (i), the process at positions 

Paint the first letters of the words from Q' into q n colors according to their occurence in 
chains. Paint also the integers from 1 to into the corresponding colors. Fix a positive 
integer a ^ m. To each integer i from 1 to attach an ordered set C a (i) of q n words 
in the following way: 

For each j = 1, 2, . . . , q n put f{i,j) = {max / ^ i : there exists k such that v(f, k) is 
painted into color j and the a-tail beginning from f consists only of letters initial in some 
tails from Q'}. 

If there is no such f then a word from C a (i) is assumed to be equal to 9, otherwise we 
assume it to be equal to the a-tail of v(f, k). 

Notation 4.3 Set 0(a) = {max k : for some i we have C a (i) = C a (i + k — 1)}. 

For a given a ^ m define a division of the sequence of word cycles {i} in W into 
equivalence classes as follows: i ~ a j if C a (i) = C a (j). 

Note that the above construction is rather similar to the construction from the proof 
of theorem IT01 Observe that B a (i) and C a (i) are rather similar as well as ip{a) and 0(a). 

Lemma 4.3 0(m) ^ qw/m. 
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^OKa3aTejn>CTBO In notation 4-1 we have enumerated word cycles. Consider the word 
cycles with numbers i, % + 1, . . . , i + [q n /m]. We have shown that each word cycle consists 
of m distinct words. Now consider words in the word cycles i,i + 1, . . . ,i + [q n /m} as 
elements of the set Q'. Then the first letter in each word cycle gets some position. The 
total number of the positions in question is not less than n. Hence at least two of these 
positions are of the same color. Now strong incomparability of word cycles implies the 
assertion of the lemma. 

Proposition 4.1 For any positive integers a < b we have 0(a) ^ 4>{b). 
Lemma 4.4 (Basic) For positive integers a, k such that ak ^ m we have 

M < Pl,d4>{k ■ a) 

,ZLoKa3aTejifaCTBO Consider the minimal representative in each class of ~fe. a . We get 
a sequence of positions {ij}. Now consider all ij and C k ' a (ij) from the same equivalence 
class o/~ a . Suppose it consists of C k a (ij) for ij G [b, c). Let {ij}' denote the segment of 
the sequence {ij}, such that ij G [b,c). 

Fix a positive integer r, 1 ^ r ^ q n . All k ■ a-beginnings of color r that begin from 
positions ofW in {ij}' will be called representatives of type r. All representatives of type r 
are distinct because they begin at the least positions in equivalence classes of ~ k . a . Divide 
each representative of type r into k segments of length a. Enumerate the segments of 
each representative of type r from left to right by integers from zero to (k — 1). // there 
exist (q n + 1) representatives of type r with the same first (t — 1) segments but pairwise 
different tth segments where 1 ^ t ^ k — 1 then there are two tth segments such that their 
first letters are of the same color. Then the initial positions of these segments belong to 
different equivalence classes of ~ a . 

Now apply lemma \3.1\ in the following way: in all representatives of type r except the 
rightmost one we consider a segment as a unit segment if it contains the least position 
where this representative of type r differs from the preceding one. All other segments are 
considered as zero segments. 

Now we can apply the process lemma for the values of parameters as given in the 
condition of the lemma. We obtain that the sequence {ij}' contains not more than q k_1 
representatives of type r. Then the sequence {ij}' contains not more than q k terms. Thus 
c - b ^ q k J)ik ■ a). 

4.4 Completion of the proof for theorem [T2 

Suppose 

Go = 3^..^ Gl = 3 nog 3Pni ^- 1; ^ ar log3Pn d , = 1. 
Substitute these a« into lemmas 14.41 and 14.31 to obtain 

0(1) *C g^0(3) ^ ^,0(9) ^ • • < gr° g3m >M ^ 

< rt 3 r log 3 m n +l 
\ Hn 

Since G\ takes not more than 1 + q n l distinct values, we have 

< log3mn+1 (l + qj) < n 3riog 3" n+2 /. 

By virtue of lemma 14. II the number of subwords Xi of length m is less than 2n 31og3 ™ n+3 /. 
Thus the total number of subwords Xi is less than 2n 3riog3 ™ n+4 /. So s < 2n 3riog3 ™ n + 4 / and 
theorem [T2l is proven. 
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5 Proof of the main theorem 
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and of theorem 
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5.1 Outline of the proof 

Now an n-divisible word will mean a word n-divisible in ordinary sense. To start with, we 
find the necessary number of fragments in W with length of the period not less than 2n. 
For this, it suffices to divide W into subwords of large length and to apply theorem [10] 
to them. However the estimate can be improved. For this, we find a periodical fragment 
U\ in W with the period length not less than 4n. Removing Ui, we obtain a word W\. 
In W\ we find a fragment u 2 with period length not less than 4n and remove it to get 
a word W 2 - Now we again remove a periodical fragment and proceed in this way, as is 
described in the algorithm 15. II in more detail. Then we restore the original word W using 
the removed fragments. Further we show that a subword Ui in W usually is not a product 
of a big number of not neighboring subwords. In lemma 15.11 we prove that application 
of algorithm 15.11 enables to find the necessary number of removed subwords of W with 
period length not less than 2n. 

5.2 Summing up of essential heights and nilpotency degrees 

Notation 5.1 Let Ht(u') denote the height of a word w over the set of words of degree 
not exceeding n. 

Consider a word W of height Ht(W) > $(n, I). Apply the following algorithm to it: 
Algorithm 5.1 



Step 1. By theorem ITU the word W includes a subword with period length An. Suppose 
Wq = W = u' l xf?y[ where the word Xy is not cyclic. Represent y[ in the form 
y[ = x^jyi where r 2 is maximal possible. Represent u[ as u[ = UiX^} where r\ is 
maximal possible. Denote by f\ the word 

W = Ul xt? +r ^ yi = uxhyt. 

In the sequel, the positions contained in fi are called tedious, the last position of U\ 
is called tedious of type 1, the second position from the end in u\ is called tedious 
of type 2, the nth position from the end in Ui is called tedious of type n. Put 
Wi = Uxyx. 

Step k. Consider the words Uk-i, Vk-i, Wk-i = Uk-iyu-i constructed at the preceding 
step. If |Wfe_i| ^ then we apply theorem[Ws to W with the restriction that 

the process in the main lemma HOI is applied only to non-tedious positions and to 
tedious positions of type greater than ka where k and a are the parameters from 
lemma \3.°A 

Thus Wk-i includes a non-cyclic subword with period length 4n such that 

W k _ x = u' k xpy' k . 

Then put 

ri := sup{r : u' k = u k x r k ,}, r 2 := sup{r : y' k = x r k ,y k }. 
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(Note that the words involved may be empty.) 
Define f k by the equation 

W k -i = u k xp +ri+V2 y k = u k f k y k . 

In the sequel, the positions contained in f k are called tedious, the last position in u k 
is tedious of type 1, the second position from the end in u k is tedious of type 2, 
the nth position from the end in u k is tedious of type n. If a position occurs to be 
tedious of two types then the lesser type is chosen to it. Put W k = u k y k . 

Notation 5.2 Perform 4t + 1 steps of the algorithm \5.1\ and consider the original word 
W. For each integer i from the segment [l,4t] we have 

W = w fVw 1 f? ) ---f^w ni 

for some subwords Wj. Here fi = fj^ ■ ■ ■ fj; . Moreover we assume that for 1 ^ j ^ rij — 1 
the subwordwj is not empty. Let s(k) be the number of indices i G [l,4t] such that Hi = k. 

To prove theorem [101 we have to find as many long periodic fragments as possible. 
For this, we can use the following lemma: 

Lemma 5.1 s = s(l) + s(2) ^ 2t. 

^OKa3aTejit.CTBO A subword U of the word W will be called monolithic if 

1. U is a product of words of the form f^\ 

2. U is not a proper subword of a word which satisfies the above condition (1). 

Suppose that after the (i — l)th step of the algorithm 15.11 the word W includes 
monolithic subwords. Note that hi ^ — rij + 2. 

Then if Hi ^ 3 then ki ^ ki-\ — 1. If ni ^ 2 then ki ^ ki-\ + 1. Furthermore k\ = 1, 
kt ^ 1 = k\. The lemma is proven. 

Corollary 5.1 

oo 

^k ■ s(k) < lOt ^ 5s. (gZEJ) 

k=l 

,ZLoKa3aTejibCTBO From the proof of lemma \5~l\ we obtain ^2 (rij — 2) ^ 2t. 

oo oo 

By definition s(k) = At, m.e. 2s(fc) = 8t. 

k=l k=l 

Summing up these two inequalities and applying lemma 15.11 we obtain the required 
inequality \5.1[ 

Proposition 5.1 The height ofW does not exceed 

oo 

*(n, 4n, I) + k ■ s(k) < *(n, in, I) + 5s. 

k=l 

In the sequel, we consider only /, with n, ^ 2. 
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Notation 5.3 If n% = 1 then put f[ := f%. 

If rii = 2 then put f[ := f-^ where f-^ is the word of maximal length between and 

/f. 

Order the words f[ according to their distance from the beginning of W . We get a 
sequence f mi ,...,f ms where s' = s(l) + s(2). Put /f := f m .. Suppose ft' = w-x^'w'- 
where at least one of the words w\,w'l is empty. 

Remark 5.1 We may assume that at starting steps of alaorithm \5.1\ we have chosen all 
fi such that Ui = 1 . 

Now consider z'-, the subwords in W of the following form: 

Zj = X (2j-1)" > °> N = \ X (2j-l)"h 

here Vj is not equal to x^j-i)", and the beginning of z'j coincides with the beginning of a 
periodic subword in f^j-i- We will show that z'j are disjoint. 
Indeed, if ft._ x = / m2j ._ 1 then z'j = fm^Vj. 

If fUj-i = fmlj-i, k = 1,2, and z'j intersects z'j +l then C ^. Since £(2j)" and X(2 3 -i)" 
are not cyclic, we have |x(2j)"| = \x(2j-iy |- But then the period length in z'j is not less 
than An, a contradiction with remark 15.11 

Thus we have proved the following lemma: 

Lemma 5.2 In a word W with height not greater than {ty{n,An,l) + 5s') there exist at 
least s' disjoint periodic subwords such that the period occurs in each of them at least 2n 
times. Furthermore between any two elements of this set of periodic subwords there is a 
subword with the same period length as the leftmost of these two elements. 

5.3 Completion of the proof for the main theorem [8] and for the- 
orem [9] 

Replace s' in lemma 15.21 by s from the proof of theorem [12] to obtain that the height of 
W does not exceed 

#(71,4/1,0 + 5S < E x l ■ n ^+121og 3 n . ? 

where E l = 4 211og s 4+17 , E 2 = 301og 3 4 + 10. 

Thus we have obtained the assertion of the main theorem |8j 

Proof of theorem [9] is completed similarly but we have to replace in part 14.41 the 

sequence 

~ _ Q r log 3 p„, d n _ o r log 3 p nid n -l 1 

"0 — ° ) "1 — ° 5 • • • 5 Ur log 3 p n ^ — 1 

by the sequence 

ao = 2 r iog 2 P., d ; ai = 2 r iog 2P „, d -i ; ? ar log2Pn d , = 1, 
and to take the value of #(n, An, I) from theorem [TT1 
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6 Comments 



The technique presented to the reader appears to enable improvement of the estimate 
obtained in this paper. However this estimate will remain subexponential. A polynomial 
estimate if it exists, requires new ideas and methods. 

At the beginning of the solution presented, subwords of a large word in the application 
of Shirshov theorem are used mainly as a set of independent elements, not as a set of 
closely related words. Further we use a coloring of letters inside subwords. Account of 
coloring of first letters only leads to an exponential estimate. Account of coloring of all 
letters in the subwords results in an exponent as well. This fact is due to constructing of 
a hierarchical system of subwords. A detailed investigation of the presented connection 
between subwords together with the solution presented above may improve the presented 
estimate up to a polynomial one. 

It is also of interest to obtain estimates for height of an algebra over the set of words 
whose degrees do not exceed the complexity of the algebra (Pi-degree in literature in 
English). The paper [3] presents exponential estimates, and for words that are not a linear 
combination of lexicographically smaller words, overexponential estimates were obtained 
in [38]. 

The deep ideas of original works by A. I. Shirshov jU [2] which stem from elimination 
technique in Lie algebras, may be highly useful, among other issues, for improvement of 
estimates, despite the fact that the estimates for height in these papers are only primitively 
recursive. 
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